55 outlier detection #105

Maximophone · 2018-03-01T13:02:57Z

No description provided.

…dvanced graphs notebook

ukclivecox

Can we link from top level readme in a new section "components" ? to the notebook describing the outlier detector.

ukclivecox · 2018-03-01T14:28:44Z

notebooks/advanced_graphs.ipynb

@@ -25,7 +25,7 @@
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
-    "collapsed": true
+    "collapsed": false


There seems to be a missing proto compile/copy as used in other notebooks. One gets the error:
ImportError Traceback (most recent call last)
in ()
1 import requests
2 from requests.auth import HTTPBasicAuth
----> 3 from proto import prediction_pb2
4 from proto import prediction_pb2_grpc
5 import grpc

ImportError: cannot import name prediction_pb2

Did you build the protos locally first? Using the makefile in notebooks

I always add to the notebooks:

!cp ../proto/prediction.proto ./proto
!python -m grpc.tools.protoc -I. --python_out=. --grpc_python_out=. ./proto/prediction.proto

so they are self contained

ukclivecox · 2018-03-01T14:34:02Z

notebooks/advanced_graphs.ipynb

@@ -569,7 +569,7 @@
    "* Two models\n",
    "\n",
    "The outlier detector is a special kind of transformer that will populate a tag in the response metadata with the outlier score it has calculated. \n",
-    "We use the docker image seldonio/mock_outlier_detector:1.0 for the outlier detector.\n",
+    "We use the docker image seldonio/outlier_mahalanobis:0.2 for the outlier detector.\n",


Is it worth adding some explanation of the returned values from this test. Are are the outlier scores meat to be useful here?

No since the features sent are meaningless they can't really be interpreted

actually, if you always send the same 2 points, as is the case with the rest_request, you will always see an outlier score of 0

ukclivecox · 2018-03-01T14:37:45Z

examples/transformers/outlier_mahalanobis/outlier_documentation.ipynb

+   "source": [
+    "The output of the algorithm (outlier score) is a measure of distance from the center of the features distribution (Mahalanobis distance). The algorithm is online, which means that it starts without knowledge about the distribution of the features and learns as requests arrive. Consequently you should expect the output to be bad at the start and to improve over time. \n",
+    "\n",
+    "The output being a real positive number, we leave it to the user to decide on a threshold for when a point will be consider to be an outlier.\n",


typo - considered

ukclivecox · 2018-03-01T14:38:43Z

examples/transformers/outlier_mahalanobis/outlier_documentation.ipynb

+    "As observations arrive, the algorithm will:\n",
+    "- Keep track and update the mean and sample covariance matrix of the dataset\n",
+    "- Apply a principal component analysis using these moments and project the new observations on the first 3 principal components (default value, can be changed)\n",
+    "- Compute the Mahalanobis distance from this projections to the projected mean\n"


typo - projection

ukclivecox · 2018-03-01T14:41:28Z

examples/transformers/outlier_mahalanobis/outlier_documentation.ipynb

+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "To compute the outlier score of each point in the new batch, we need the inverse of the covariance matrix of all the points up to this one. This means inverting $b$ matrices. We made this operation faster by leveraging the fast that each covariance matrix is a rank one update of the previous one. \n",


typo - "the fast"

ukclivecox · 2018-03-01T14:42:24Z

wrappers-docker/Makefile

@@ -1,5 +1,5 @@
 IMAGE_NAME=docker.io/seldonio/core-python-wrapper
-IMAGE_VERSION=0.7
+IMAGE_VERSION=0.8


Have we updated all docs to version 0.8?

Not yet. These changes only impact someone who wants to build and wrap an outlier detector, but this isn't document anywhere at the moment...

Maximophone and others added 16 commits February 7, 2018 17:15

Started work on outlier detector

38b6169

More work on outlier detector

62dc609

Implementing online batch mahalanobis distance

c5f247c

Updating OD model

740b2d9

Finalised online mahalanobis distance

c720a36

Starting work on online PCA

d212c04

Implemented PCA

7999efa

Started improvement of covariance online algo

884b191

more work on new algo

7d6dab4

Further work on algo

7385e0c

Updated algorithm with non stationarity option, updated documentation

eb24240

typo

3c02dab

Modifications to wrapper

1056841

Merge branch 'master' into 55-outlier_detection

073b0ae

Updated wrapper docker version to 0.8, using outlier mahalanobis in a…

abaa724

…dvanced graphs notebook

Removed draft notebook, added tests notebook, added requirements.txt

d8f086b

Maximophone requested a review from ukclivecox March 1, 2018 13:02

ukclivecox suggested changes Mar 1, 2018

View reviewed changes

Added code to generate protos in advanced graphs notebook

2f1e416

ukclivecox approved these changes Mar 1, 2018

View reviewed changes

Maximophone added 3 commits March 2, 2018 10:38

typo

4f3a530

Update outlier_documentation.ipynb

ddd6ebb

typo

f56822c

ukclivecox merged commit f6ad032 into SeldonIO:master Mar 2, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

55 outlier detection #105

55 outlier detection #105

Maximophone commented Mar 1, 2018

ukclivecox left a comment

ukclivecox Mar 1, 2018

Maximophone Mar 1, 2018 •

edited

Loading

ukclivecox Mar 1, 2018

Maximophone Mar 1, 2018

ukclivecox Mar 1, 2018

Maximophone Mar 1, 2018

Maximophone Mar 1, 2018

ukclivecox Mar 1, 2018

ukclivecox Mar 1, 2018

ukclivecox Mar 1, 2018

ukclivecox Mar 1, 2018

Maximophone Mar 1, 2018

55 outlier detection #105

55 outlier detection #105

Conversation

Maximophone commented Mar 1, 2018

ukclivecox left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Maximophone Mar 1, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Maximophone Mar 1, 2018 •

edited

Loading